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RELATED APPLICATION(S) 

This application is a continuation claiming priority to International Application 
5 No. PCT/CA98/01 133, filed December 7, 1998 (designating the U.S.), which claims 
priority to Canadian Patent No. 2,218,199, filed December 9, 1997, the entire teachings 
of both of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

10 The invention relates to PAB II gene, and its uses thereof for the diagnosis, 

prognosis and treatment of a disease related with protein accumulation in nucleus, such 
as oculopharyngeal muscular dystrophy. 



Description of Prior Art - 

Autosomal dominant oculopharyngeal muscular dy$rpphy(OPMD) is an adult- 

1 U 
onset disease with a world-wide distribution. It usually presents itself in the sixth 

decade with progressive swallowiife difficulties (dys-phagia), eye lid drooping (ptosis) 

and proximal limb weakness. Unique nuclear filament inclusions in skeletal muscle 

fibers are its pathological hallmark (uome, F.M.S. & Fardeau, Acta Neuropath. 49, 85- 

87 (1980)). Using the full power of lmkage analysis in eleven French Canadian families, 
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vaff* e oculopharyngeal muscular dystrophy gene was fine mapped on human chromosome 
Ijflfyr 14 OB™ 55 et al *> 1997 > Neuromuscu&r Disorders 7 (Suppl. l):S70-74). A region of .75 
cM was thereby identified as a regicm containing the potential and unknown OMPD 
gene (Brais et al., 1997, supra), Un&rtunately, the OMPD gene has yet to be isolated 
5 and its nucleic acid or protein sequeirce have yet to be cribbed. 

It would be highly desirable to be provided with a tool for the diagnosis, 
prognosis and treatment of a disease related with polyalanine accumulation in the 
nucleus, such as observed in oculopharyngeal muscular dystrophy. 

SUMMARY OF THE INVENTION 
10 One aim of the present invention is to provide a tool for the diagnosis, prognosis 

and treatment of a disease related with polyalanine accumulation in nucleus, such as 
oculopharyngeal muscular dystrophy. 

Herein, the poly(A)%nding protein II (PAB II) gene was isolated from a 217 kb 
"candidate interval in chromosome 14qll.A (GCG)6 repeat encoding a polyalanine tract 
located at the N-terminus of thl protein was expanded to (GCG)8-13 in the 144 OPMD 
families screened. More severe mheno-types were observed in compound heterozygotes 
for the (GCG)9 mutation and a (®CG)7 allele found in 2% of the population, whereas 
homozygosity for the (GCG)7 allele leads to autosomal recessive OPMD. Thus the 
(GCG)7 allele is an example of a powmorphism which can act as either a modifier of a 
20 dominant phenotype or as a recessivOTnutation. Pathological expansions of the 
polyalanine tract may cause mutated P^B II oligomers to accumulate as filament 
inclusions in nuclei. 

In accordance with the present invention there is provided a human PAB II gene 
^^^ontaining a transcribed polymorahic GCG repeat, which comprises a sequence as set 
25 forth in Fig. 4, which includes intrc^ts and flanging genomic sequence. 

The allelic variants of GCG repeat of the human PAB II gene are associated with 
a disease related with protein accumulation in the nucleus, such as polyalanine 
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accumulation, or with a disease related with swallowing difficulties, such as 
oculopharyngeal muscular dystrophy. 

In accordance with the present invention there is also provided a method for the 
diagnosis of a dis-ease associated with protein accumulation in the nucleus, which 
5 comprises the steps of: 

a) obtaining a nucleic acid sample of said patient; and 

b) determining allelic variants of a GCG repeat of the human PAB II gene; thereby 
long allelic variants are indicative of a disease related with protein accumulation 
in the nucleus, such as polyalanine accumulation and oculopharyngeal muscular 

10 dystrophy. 

The long allelic variants have from about 245 to about 263 bp in length. 
In accordance with the present invention there is also provided a non-human 
mammal model for the human PAB II gene, whose germ cells and somatic cells are 
modified to express at least one allelic variant of the PAB II gene and wherein said 
15 allelic variant of the PAB II is being introduced into the mammal, or an ancestor of the 
mammal, at an embryonic stage. 

In accordance with the present invention there is also provided a method for the 
screening of thera-peutic agents for the prevention and/or treatment of oculopharyngeal 
muscular dystrophy, which comprises the steps of: 
20 a) administering the therapeutic agents to the non-human animal of the present 
invention or oculopharyngeal muscular dystrophy patients; and 
b) evaluating the prevention and/or treatment of development of oculopharyngeal 
muscular dystro-phy in this animal (such as a mammal) or in patients. 
In accordance with the present invention there is also provided a method to 
25 identify genes-products thereof, or part thereof, which interact with a biochemical 
pathway affected by the PAB II gene, which comprises the steps of: 
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a) designing probes and/or primers using the PAB II gene and screening 
oculopharyngeal muscular dystrophy patients samples with said probes and/or 
primers; and 

b) evaluating the role of the identified gene in oculopharyngeal muscular 
5 dystrophy patients. 

BRIEF DESCRIPTION OF THE DRAWINGS 

t Figs. 1 A- IB illustrate the positional cloning of the PAB II gene; 

<5^L-5: ^ Figs. 2A-2G illustrate the OP MD (GCG)n expansion sizes andse 

tioifsiteTSEQroNOS : 1 -2); 
10 Fig. 3 illustrates the age distribution of swallowing time (st) for French 

Canadian OPMD carriers of the (GCG)9 mutation; and 
Figs. 4A-4E illustrate the nucleoti de sequenc 
I)(SEQ ID NO:3). 



15 



DETAILED DESCRIPTION OF THE INVENTION 

In order to identify the gene mutated in OPMD, a 350 kb cosmid contig was 
constructed between flanking markers D14S990 and D14S1457 (Fig. 1A). Positions of 
the PAB II-selected cDNA clones were determined in relation to the EcoRI restriction 
map and the Genealogy-based Estimate of Historical Meiosis (GEHM)-derived 
candidate interval (Rommens, J.M. et al., in Proceedings of the third international 
20 workshop on the identification of tran-scribed sequences (eds. Hochgeschwender, U. & 
Gardiner, K.) 65-79 (Plenum, New York, 1994)). 

The human poly(A) binding protein II gene (PAB II) is encoded by the 
nucleotide sequence as set forth in Fig. 4. 

Twenty-five cDNA&^ere isolated by cDNA selection from the candidate <Q 
interval (Rommens, J.M. et aft, in Proceedings of the third international workshop on 
the identification of transcribe® sequences (eds. Hochgeschwender, U. & Gardiner, K.; 
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65-79; Plenum, New York, 199|). Three of these hybridized to a common 20 kb EcoRI 
restriction fragment and showedlhigh sequence homology to the bovine poly(A) binding 
protein II gene(bPAB II) (Fig. 1 A). The PAB II gene appeared to be a good candidate 
for OPMD because it mapped to me genetically defined 0.26 cM candidate interval in 
14ql 1 (Fig. 1 A), its mRNA showld a high level of expression in skeletal muscle, and 
the PAB II protein is exclu-sively localized to the nucleus (Krause, S. et al., Exp. Cell 
Res. 214, 75-82 (1994)) where it alts as a factor in mRNA polyadenylation (Whale, E., 
Cell 66, 759-768 (1991); Whale, E.fet al., J. Biol. Chem. 268, 2937-2945 (1993); 
Bienroth, S. et al., EN£BO J. 12, 5851594 (1993)). 

A 8 kb HindII%enomic fragment containing the PAB II gene was subcloned 
and sequenced (6002 bp\GenBank: AF026029)(Nemeth, A. et al., Nucleic Acids Res. 
23, 4034-4041 (1995)) (F%. IB). Genomic structure of the PAB II gene, and position of 
the OPMD (GCG)n expansions. Exons are numbered. Introns 1 and 6 are variably ^ ^ 

present in 60% of cDNA ckmes. ORF, open readying frame; cen, centromere and tel, ^ ^aY 5 ^ 
telomere. 

The coding sequenc<&vas based on the previously published bovine sequence 
(GenBank: X89969) and the ^quence of 31 human cDNAs and ESTs. The gene is 



comunosed of 7 exons and is i 



splice variants are found in 



iscribed in the center orientation (Fig. IB). Multiple 



ES1 



is and on Northern blots (Nemeth, A. et al., Nucleic 
20 Acids Res. 23, 4034-4041 (1993|). In particular, introns 1 and 6 are present in more 
than 60% of clones (Fig. 1B)( NSneth, A. et al, Nucleic Acids Res. 23, 4034-4041 
(1995)). The coding and protein sequences are highly conserved between human, bovine 
and mouse (GenBank: U93050). 93% of the PAB II sequence was read- 
ily amenable to RT-PCR- or genonmc-SSCP screening. No mutations were uncovered 
25 using both techniques. However, a ^00 bp region of exon 1 containing the start codon 
could not be readily amplified. This rlgion is 80% GC rich. It includes a (GCG)6 repeat 
which codes for the first six alanines o\a homopolymeric stretch of 10 (Fig. 2G). 
Nucleotide sequence of the mutated region of PAB II as well as the amino acid 
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sequences of the N-ferminus polyalanine stretch and position of the OPMD alanine 
insertions is also shoVn in Fig. 2. 

Special conditions were designed to amplify by PCR a 242 bp genomic 
fragment including this GCG-repeat. The (GCG)6 allele was found in 98% of French 
Canadian non-OPMD control chromosomes, whereas 2% of chromosomes carried a 
(GCG)7 polymorphism (n=86) (Brais, B. et al., Hum. Mol. Genet. 4, 429-434 (1995)). 

Screening OPMD cases belonging to 144 families showed in all cases a PCR 
product larger by 6 to 21 bp than that found in controls (Fig. 2 A). (GCG)6 normal allele 
(N) and the six different (GCG)n expansions observed in 144 families. 

Sequencing onthese fragments revealed that the increased sizes were due to 
"expansions of the GCG\epeat (Fig. 2G). Fig. 2F shows the sequence of the (GCG)9 
French Canadian expansion in a heterozygous par-ent and his homozygous child. 
Partial sequence of exon ltin a normal (GCG)6 control (N), a heterozygote (ht.) and a 
homozygote (hm.) for the (®CG)9-repeat mutation. The number of families sharing the 
different (GCG)n-repeats ex^nsions is shown in Table 1 . 
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Table 1 

Number of families sharing the different dominant (GCG)n OPMD mutations 





5 


Mutations 


Polyalaninef 


Families 






(GCG)8 


12 


4 






(GCG)9 


13 


99 






(GCG)10 


14 


19 




10 


(GCG)ll 


15 


16 


y3 




(GCG)12 


16 


5 


: = 




(GCG)13 


17 


1 






Total 




144 








t, 10 alanine residues in normal PAB II. 



20 



1 5 The (GCG)9 expansion shared by 70 French Cana-dian families is the most 

frequent mutation we observed (Table l).The (GCG)9 expansion is quite stable, with a 
single doubling observed in family Fl 51 in an estimated 598 French Canadian meioses 
(Fig. 2C). The doubling of the French Canadian (GCG)9 expansion is demonstrated in 
Family F151. 

This contrasts with the unstable nature of pr^^iously described disease-causing 
triplet-repeats (Rosenberg, R.N., New Eng. J. Med. 335, 1222-1224 (1996)). 

Genotyping of jftjl the participants in the clinical study of French Canadian 
"OPMD provided molecule" insights into the clinical variability observed in this 
condition. The genotypes rar both copies of the PAB II mutated region were added to an 
25 anonymous version of this clnical database of 176 (GCG)9 mutation carriers (Brais, B. 
et al., Hum. Mol. Genet. 4, 4ZW-434 (1995)). Severity of the phenotype can be assessed 
by the swaMpwing time (st) in\econds taken to drink 80 cc of ice-cold water (Brais, B. 
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et al., Hum. Mol. Genet. 4, 4^-434 (1995); Bouchard, J.-P. et al., Can. J. Neurol. Sci. 
19, 296-297 (1992)).The late Inset and progressive nature of the muscular dystrophy is 
clearly illustrated in heterozygous carriers of the (GCG)9 mutation (bold curve in 
Fig. 3) when compared to the average st of control (GCG)6 homozygous 
5 participants(n=76, thinner line imFig. 3). The bold curve represents the average OPMD 
st for carriers of only one copy olthe (GCG)9 mutation (n=169), while the thinner line 
corresponds to the average st for |GCG)6 homozygous normal coytrols(n=76). The 
black dot corresponds to the st vame.for individual VIII. Roman numerals refer to 
individual cases shown in Figs. 2eL 2D and discussed in the text. The genotype of a 
10 homozygous (GCG)9 patient and nkr parents is shown in Fig. 2B. Independent 
segregation of the (GCG)7 allele is^lso shown. Of note, case V has a more severe 
OPMD phenotype (Fig. 2D). 

Two groups of genotypftally distinct OPMD cases have more severe 
swallowing difficulties. Individuals I, II, and III have an early-onset disease and are 
homozygous for the (GCG)9 expansion (P < 10-5) (Figs. 2B, F). Cases IV, V, VI and 
VII have more severe phenotypes and are compound heterozygotes for the (GCG)9 
mutation and the (GCG)7 polymorphism (P < 10-5). In Fig. 2D the independent 
segregation of the two alleles is shovm. Case V, who inherited the French Canadian 
(GCG)9 mutation and the (GCG)7 polymorphism, is more symptomatic than his brother 
20 VIII who carries the (GCG)9 mutationVd a normal (GCG)6 allele (Figs. 2D and 3). 
The (GCG)7 polymorphism thus appeaA to be a modifier of severity of dominant 

A \ ^ 

OPMD. Further-more, the (GCG)7 alleleTcan act as a recessive mutation. This was 
documented in the French patient IX who inherited two copies of the (GCG)7 
polymorphism and has a late-onset autosomal recessive form of OPMD (Fig. 2E). Case 
25 IX, who has a recessive form of OPMD, is s\own to have inherited two copies of the 
(GCG)7 polymorphism. 

This is the first description of&hort trinu-cleotide repeat expansions causing a 
' human disease. The addition of only two GCG repeats is sufficient to cause dominant 
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OPMD. OPMD expansions do not slfere the cardinal features of "dynamic mutations". 
The GCG expansions are not only short they are also meiotically quite staj>le. 
Furthermore, there is a clear cut-off between the normal and abnormal alleles, a single 
GCG expansion causing a recessive phlnotype. The PAB II (GCG)7 allele is the first 
5 example of a relatively frequent allele A^iich can act as either a modifier of a dominant 
phend^ype or as a recessive mutation, lliis dosage effect is reminiscent of the one 
observed in a homozygote for two dominant synpolydactyly mutations. In this case, the 
patient had more severe deformities bec^ise she inher-ited two duplications causing 
an expansion in the polyalanine tract of tffe HOXD13 protein (Akarsu, A.N. et al., Hum. 
10 Mol. Genet. 5, 945-952 (1996)). A duplicMion causing a similar polyalanine expansion 
in the a subunit 1 gene of the core-binding nranscription factor (CBF(l) has also been 
found to cause dominant cleido/cranial dysplasia (Mundlos, S. et al., Cell 89, 773-779 
(1997)). The mutations in these two rare diseases are not triplet-repeats. The are 
duplications of "cryptic repeats" composed o| mixed synonymous codons and are 
15 thought to result from unequal crossing over foVarren, S.T., Science 275, 408-409 
(1997)). In the case of OPMD, slippage durinffireplication causing a reiteration of the 
GCG codon is a more likely mechanism (Well| D.R., J. Biol. Chem. 271, 2875-2878 
(1996)). 

Different observations^onverge to suggest that a gain of function of PAB II 
ay cause the accumulation of nuclear filaments observed in OPMD (Tome, F.M.S. & 
Fardeau, Acta Neuropath. 49, 85|87 (1980)). PAB II is found mostly in dimeric and 
oligomeric forms (Nemeth, A. etfal., Nucleic Acids Res. 23, 4034-4041 (1995)). It is 
possible that the polyalanine tractrolays a role in polymerization. Polyalanine stretches 
have been found in many other nullear proteins such as the HOX pro-jteins, but their 
function is still unknown (Davies, Jw. et al., Cell 90, 537-548 (1997)). Alanine is a 
highly hydrophobic amino acid present in the cores of proteins. In dragline spider silk, 




T 



25 



polyalanine stretches are thought to rarm B-sheet structures important in ensuring the £) 
fibers' strength (Simmons, A.H. et al.,tt>cience 271, 84-87 (1996)). Polyalanine 
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oligomers have also been shewn to be extremely resistant to chemical denaturation and 
enzymatic degradation (Foro©d, B. et al., Bioch. and Biophy. Res. Com. 211,7-13 
(1995)). One can speculate that PAB II oligomers comprised of a sufficient number of 
mutated molecules might acci&nulate in the nuclei by forming undegradable polyalanine 
rich macromolecules. The ratepf the accumulation would then depend on the ratio of 
mutated to non-mutated protein. The more severe phenotypes observed in homozygotes 
for the (GCG)9 mutations and lompound heterozygotes for the (GCG)9 mutation and 
(GCG)7 allele may correspond Sp the fact that in these cases PAB II oligomers are 
composed only of mutated proteins. The ensuing faster filament accumulation could 
10 cause accelerated cell death. The\recent description of nuclear filament inclusions in 
Huntington's disease, raises the p&ssibility that "nuclear toxicity" caused by the 
accumulation of mutated homopowmeric domains is involved in the molecular 
pathophysiology of other triplet-regeat diseases (Davies, S.W. et al., Cell 90, 537-548 
(1997); Scherzinger, E. et al., Cell lo, 549-558 (1997); DiFiglia, M. et al, Science 277, 
15 1990-1993 (1997)). Future immunolytochemical and expression studies will be able to 
test this patho-physiological hypoth&is and provide some insight into why certain 
muscle groups are more affected whi^ all tissues express PAB II. 

Methods 
Contig and cDNA selection 

20 The cosmid contig was constructed by standard cosmid walking techniques 

using a gridded chromosome 14-specific cosmid library (Evans, G.A. et al., Gene 79, 9- 
20 (1989)). The cDNA clones were isolated by cDNA selection as previously described 
(Rommens, J.M. et al., in Proceedings of the third international workshop on the 
identification of transcribed sequences (eds. Hochgeschwender, U. & Gardiner, K.) 65- 

25 79 (Plenum, New York, 1994)). 
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£loning of the PAB II gene 

Three cDNA clones corresponding to PAB II were sequenced (Sequenase, 
SB). Clones were verified to map |p cosmids by South-ern hybridization. The 8 kb 
Hindlll restriction frag-ment was su&cloned from cosmid 166G8 into pBluescriptn (SK) 
5 (Stratagene). The clone was sequenced using prim-ers derived from the bPABII gene 
and human EST sequences. Sequencing of the PAB II introns was done by primer 
walking. 




I II mutation screening &nd sequencing 

All cases were diagnosed as having OPMD on clinical grounds (Brais, B. et al., 
Hum. Mol. Genet. 4, 429-434 \ 1995)). RT-PCR- and genomic SSCP analyses were 
done using stan-dard protocoll(Lafreniere, R.G. et al., Nat. Genet. 15, 298-302 (1997)). 
:5idk_^^The primers used to amplify th© PAB II mutated i 

CGCAGTGCCCCGCCT'£Ae7^3 T (SEQ ID NO:4) and 5'- 
AeAAeArGGCGCCGCCGCC^CGGC-S' (SEQ ID NO:5). PCR reactions were 
15 performed in a total volume of IS |il containing: 40 ng of genomic DNA; 1 .5 ^g of 
BSA; 1 nM of each primer; 250 Sm dCTP and dTTP; 25 \iM dATP; 125 nM of dGTP 
and 125 jiM of 7-deaza-dGTP (PlLmacia); 7.5% DMSO; 3.75 nCi[ 35 S]dATP, 1.5 unit 
of Taq DNA polymerase and 1.5 mM MgC12 (Perkin Elmer). For non-radioactive PCR 
reactions the [ 35 S]dATP was replaced by 225 \iM of dATP. The amplification procedure 
20 consisted of an initial denatuurationlstep at 95°C for five minutes, followed by 35 cycles 
of denaturation at 95 °C for 15 s, ann^ling at 70°C for 30 s, elongation at 74°C for 30 s 
and a final elongation at 74°C for 7 nJ|n. Samples were loaded on 5% polyacrylamide 
denaturing gels. Following electroph&esis, gels were dried and autoradiographs were 
obtained. Sizes of the inserts were detemiined by comparing to a standard M13 
25 sequence (Sequenase™, USB). Fragments used for sequencing were gel-purified. 



^Sequencing uf the inu l d t e thfragment usinguhe Amplicycfe kit Am (Ferkin Mi fflei) was - 
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10 



15 



done with the S'-CGCBlGTG CCCCGCCTTAGAGGTG- ^' f sf.o m NO fi) primai^-^ 
5iftemperaturl of 68°C. 

Stability of (GCG)-repeat expansions 

The meiotic stability of tire (GCG)9-repeat was estimated based on a large 
French Canadian OPMD cohort. Iffliad been previously established that a single 
ancestral OPMD carrier chro-mosome was introduced in the French Canadian 
population by three sisters in 1648. Seventy of the seventy one French Canadian OPMD 
families tested to date segregate a (0|CG)9 expansion. However, in family F151, the 
affected brother and sister, despite sharing the French Canadian ancestral haplotype, 
carry a (GCG)12 expansion, twice thelsize of the ancestral (GCG)9 mutation (Fig. 2C). 
In this founder effect study, it is estim&ed that 450 (304-594) historical meioses shaped 
the 123 OPMD cases belonging to 42 ofithe 71 enrolled families. The screening of the 
full set of participants allowed an identification of another 148 (GCG)9 carrier 
chromosomes. Therefore, it is estimated fifrat a single mutation of the (GCG)9 expansion 
has occurred in 598 (452-742) meioses. 



Genotype-phenotype correlations 

176 carriers of at least one copy of the (GCG)9 mutation were examined during 
the early stage of the linkage study. All were asked to swallow 80 cc of ice-cold water 
as rapidly as possible. Testing was stopped after 60 seconds. The swallowing time (st) 

20 was validated as a sensitive test to identify OPMD cases (Brais, B. et al, Hum. Mol. 
Genet. 4, 429-434 (1995); Bouchard, J.-P. et al., Can. J. Neurol. Sci. 19, 296-297 
(1992)). The st values for 76 (GCG)6 homozygotes normal controls is illustrated in Fig. 
3. Analyses of variance were computed by two-way ANOVA (SYSTAT package). For 
the (GCG)9 homozygotes their mean st value was compared to the mean value for all 

25 (GCG)9 heterozygotes aged 35-40 (P < 10' 5 ). For the (GCG)9 and (GCG)7 compound 
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heterozygotes their mean st value was compared to the mean value for all (GCG)9 
heterozygotes aged 45-65 (P < 10" 5 ). 



'embodiments thereof, it willroe understood that it is capable of further modifications 



5 and this application is intend™ to cover any vari-ations, uses, or adaptations of the 
invention follow-ing, in 'genera, the principles of the invention and including such 
departures from the present disclosure as come within known or customary practice 
within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth, am as follows in the scope of the appended claims. 




While the inventionlhas been described in conZ-nection with specific 



